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Abstract. Let G be a finitely generated group, A a finite set of generators and K a 
' subgroup of G. We call the pair (G, K) context-free if the set of all words over A that 

reduce in G to an element of K is a context-free language. When K is trivial, G itself 
is called context-free; context-free groups have been classified more than 20 years ago in 
celebrated work of Muller and Schupp as the virtually free groups. 

Here, we derive some basic properties of such group pairs. Context-freeness is indepen- 
dent of the choice of the generating set. It is preserved under finite index modifications 
of G and finite index enlargements oi K . If G is virtually free and K is finitely generated 
then (G, K) is context-free. A basic tool is the following: (G, K) is context-free if and 
only if the Schreier graph of (G, K) with respect to A is a context-free graph. 

^ ; 1. Introduction 

Let G be a finitely generated group and K a subgroup (not necessarily finitely gener- 
ated). We can choose a finite set A C G of generators such that every element of G is 
of the form g = g\ ■ ■ ■ g n , where n > and gx, . . . , g n G A. Thus, A generates G as a 
semigroup. We shall say that (G, K) is context-free, if - loosely spoken - the language of 
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The precise definition needs some preparation. Let S be a finite alphabet and if) : S — > 
G be a (not necessarily injective) mapping such that A = if)(E) satisfies the above finite 
generation property for G. Then if) has a unique extension, also denoted if), as a monoid 
homomorphism if) : S* —>■ G. Recall that S* consists of all words w — a\ ■ ■ ■ a n , where 
n > and ai, . . . ,a n G S (repetitions allowed). The number n is the length \w\ of w. If 
n = this means that w = e, the empty word. This is the neutral element of XI*, and S* 
is a free monoid with the binary operation of concatenation of words. The extension of 
ijj is of course given by 

ip(ai ■ ■ -a n ) = ip(ai) ■ ••ip(a n ) , 

where the product on the right hand side is taken in G. Given these ingredients, we 
shall say that ip : S — >• G is a semigroup presentation of G, referring to the fact that A 
generates G as a semigroup. A language over S is a non-empty subset of S*. 

(1.1) Definition. The word problem of (G,K) with respect to if) is the language 

L(G, K, if)) = {w G S* : if){w) G K} . 
We say that the triple (G, K, if)) is context-free, if L(G, K, if)) is a context-free language. 



Date: October 31st, 2009. 

2000 Mathematics Subject Classification. 20F10, 68Q45, 05C25 . 

Key words and phrases. Finitely generated pair of groups, word problem, context-free language, push- 
down automaton, context-free graph. 

The first author was partially supported by a visiting professorship at TU Graz. The second author 
was partially supported by a visiting professorship at Universita di Roma - La Sapienza and the Austrian 
Science Fund project FWF-P19115-N18. 

1 



2 



T. Ceccherini-Silberstein and W. Woess 



A context-free grammar is a quadruple C = (V, X,P } £), where V is a finite set of 
variables, disjoint from the finite alphabet S (the terminal symbols), the variable S is the 
start symbol, and P C V x (V U S)* is a finite set of production rules. We write Thu 
or (T h u) G P if (T, w) G P. For v,w & (V U S)*, we write t> ==>- u> if v — v\Tv2 and 
w = V1UV2, where u, V\,Vi G (VUS)* and T \- u. This is a single derivation step, and it is 
called rightmost, if v 2 G £*. A derivation is a sequence v — Wq, Wi, . . . , — w G (VUS)* 
such that ==>- ; we then write v ==>• it). A rightmost derivation is one where 

each step is rightmost. The succession of steps of any derivation T ==^> w G S* can be 
reordered so that it becomes a rightmost derivation. For T G V, we consider the language 
Lt = {w G £* : T w}. The language generated by C is L(C) = L$- 

A context-free language is a language generated by a context-free grammar. As a basic 
reference for Language and Automata Theory, we refer to the magnificent monograph of 
Harrison [5]. 

The above definition of a context-free pair, or rather triple, (G, K, ip) makes sense when 
G is a finitely generated monoid and if is a sub-monoid, but here we are interested in 
groups. When in addition K = {1g} ; this leads to the notion of G being a context-free 
group. In two celebrated papers, Muller and Schupp pU], [H] have carried out a 
detailed study of context-free groups and more generally, context-free graphs. In particu- 
lar, context-freeness of a group is independent of the particular choice of the generating 
set A of G. The main result of [10], in combination with a fundamental theorem of 
Dunwoody [3], is that a finitely generated group is context-free if and only if it is vir- 
tually free, that is, it contains a free subgroup with finite index. (In [ID] , it is assumed 
that A = A' 1 and that ip : S — ► A = is one-to-one, but the results carry over 

immediately to the more general setting where those two properties are not required.) 

Previously, Anisimov |lj had shown that the groups whose word problem L(G, {1g} 5 
is regular (see §|2]for the definition) are precisely the finite groups. 

The abovementioned context-free graphs are labelled, rooted graphs with finitely many 
isomorphism classes of cones. The latter are the connected components of the graph that 
remain after removing a ball around the root with arbitrary radius. See §H for more 
precise details. As shown in [11], there is a natural correspondence between such graphs 
and pushdown automata, which are another tool for generating context-free languages; 
see §3] 

Among subsequent work, we mention Pelecq [12] and Senizergues [15], who studied 
actions on, resp. quotients of context-free graphs. Group-related examples occur also in 
Ceccherini-Silberstein and Woess [3]. 

More recently, Holt, Rees, Rover and Thomas [6] have introduced and studied 
co- context-free groups, which are such that the complement of L{G, {1g} ; "0) is context- 
free, see also Lehnert and Schweitzer [8]. This concept has an obvious extension 
to co-context-free pairs of groups, resp. graphs, on whose examination we do not (yet) 
embark. 

In the present notes, we collect properties and examples of context-free pairs of groups 



(G,K). 
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• The language L(G, K,tp) is regular if and only if the index [G : K\ of K in G is 
finite (Proposition 12. 5p . 

• The property that L(G, K, ip) is context-free does not depend on the specific choice 
of the semigroup presentation ip, so that context-freeness is just a property of the 
pair (C7, K), a consequence of Lemma [3.11 

• If (G, K) is context-free then L(G, K, is a deterministic context-free language 
(see $3j for the definition) for any semigroup presentation tp : S — ► G (Corollary 
UTTOla). 

• If (C7, K) is context-free and H is a finitely generated subgroup of C7, then the pair 
(H, K H if) is context-free (Lemma 13. II) . 

• If [G : f/] < oo then (C7, K) is context-free if and only if (H, KnH) is context-free 
(Proposition I3.3I fc Lemma [4. lip . 

• If (G, K) is context-free and H is a subgroup of G with K < H and [H : K] < oo 
then (G,H) is context-free (Lemma 14.11ft . 

• If K is finite then G is context-free if and only if (G, K) is context-free (Lemma 

E3D. 

• If (C7, .fT) is context-free then (C7, g~ x Kg) is context-free for every <? E C7 (Corollary 
ETTDlb). 

• If G is virtually free and K is a finitely generated subgroup of G then (C7, K) is 
context-free (Corollary 15.6ft . 

Several of these properties rely on the following. 

• A fully deterministic, symmetric labelled graph (see £j2]for definitions) is context- 
free in the sense of Muller and Schupp if and only if the language of all words 
which are labels of a path that starts and ends at a given root vertex is context- 
free (Theorems 14.21 and 14. Tf) . 

The (harder) "if" part is not contained in previous work. It implies the following. 

• The pair (C7, K) is context-free if and only if for some ( <^=^ any) symmetric 
semigroup presentation ip : S — > G, the Schreier graph of (C7, K) with respect to 
ip is a context-free graph. (See again £j2]for precise definitions). 

In a second paper [18J, a slightly more general approach to context-freeness of graphs 
via cuts and tree-sets is given. It allows to show that certain structural properties ("ir- 
reducibility" ) are preserved under finite- index- modifications of the underlying pair of 
groups. This is then applied to random walks, leading in particular to results on the 
asymptotic behaviour of transition probabilities. 

In concluding the Introduction, we remark that with the exception of some "elemen- 
tary" cases, context-free pairs of groups are always pairs with more than one end. Ends 
of pairs of groups were studied, e.g., by Scott [Hj . Swarup [16] and Sageev [15] . In 
particular, the interplay between context-freeness of pairs and decomposition as amalga- 
mated products or HNN-extensions needs still to be explored. 

Acknowledgement. We are grateful to Wilfried Imrich and Rognvaldur G. Moller for 
useful hints and discussions. 
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2. SCHREIER GRAPHS, AND THE REGULAR CASE 

Let E be a finite alphabet. A directed graph labelled by S is a triple (X, E, £), where X 
is the (finite or countable) set of vertices, BcIxSxIis the set of oriented, labelled 
edges and £ : E 3 (x , a, y) a <E H is the labelling map. 

For an edge e = (x, a, y) G E, its initial vertex is e~ = x and its terminal vertex is 
e + = y, and we say that e is outgoing from re and ingoing into y. If y = x then e is a 
/cop, which is considered both as an outgoing and as an ingoing edge. We allow multiple 
edges, i.e., edges of the form e\ = (x, Oi, y) and 62 = (x, 02, y) with a\ 7^ a<i, but here we 
exclude multiple edges where also the labels coincide. The graph is always assumed to 
be locally finite, that is, every vertex is an initial or terminal vertex of only finitely many 
edges. We also choose a fixed vertex o E X, the root or origin. We shall often just speak 
of the graph X, keeping in mind the presence of E and £. 

We call X fully labelled if at every vertex, each a G S occurs as the label of at least one 
outgoing edge. We say that X is deterministic if at every vertex all outgoing edges have 
distinct labels, and fully deterministic if it is fully labelled and deterministic. Finally, 
we say that X is symmetric or undirected if there is a proper involution a 1— > a -1 of 
£ (i.e., (a -1 ) -1 = a, excluding the possibility that a" 1 = a) such that for each edge 
e = (x, a, y) G E, also the reversed edge e" 1 = (y, a -1 , x) belongs to E. 

A path in A" is a sequence 7r = e\C2 . . . e n of edges such that ef = e~ +1 for i — 1, . . . , n— 1. 
The vertices 7r~ = ej~ and 7r + = e+ are the initial and the terminal vertex of 7r. The 
number | vr j = n is the length of the path. The label of 7r is £(-7r) = £(e 1 )£(e 2 ) ■ ■ • £(e n ) G £*. 
We also admit the empty path starting and ending at a vertex x, whose label is e. Denote 
by U XtV = U Xi y(X) the set of all paths n in X with initial vertex n~ = x and terminal 
vertex ty + = y. The following needs no proof. 

(2.1) Lemma/Definition. Let (X,E,£) be a labelled graph, x G X and w G S*. We 

define H x (w) = {n : tt~ = x , £(7r) = w}, the set of all paths that start at x and have label 
w. The set of all terminal vertices of those paths is denoted x w = {n + : n G U x (w)}. 

Analogously, we define H x (w) = {tv : n + = x , £(tt) = w}, the set of all paths that 
terminate at x and have label w, and write x~ w = {tt~ : n G H x (w)}. 

If X is fully labelled, then U x (w) is always non-empty. 

If X is deterministic, then H x (w) has at most one element, and if that element exists, 
it is denoted 7r x (w), while x w just denotes its endpoint. 

If X is fully deterministic, then x w is a unique vertex of X for every x G X, w G S*. 

Finally, if X is symmetric (not necessarily deterministic), then H x (w) = ir^-u;" 1 ), 
where for w — a\ • • ■ a n , one defines w' 1 = a' 1 ■ ■ ■ a^ 1 . 

With a labelled, directed graph as above, we can associate various languages. We can, 
e.g., consider the language 

(2.2) (X) = {£(tv) : 7T G U Xty (X)} , where x, y G X . 

(2.3) Definition. Let G be a finitely generated group, K a subgroup and : Y, ^> G a 
semigroup presentation of G. The Schreier graph X = X(G, K, ip) has vertex set 

X — K\G — {Kg : g G G} 
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(the set of all right K-cosets in G), and the set of labelled, directed edges 

E = {e = (x, a, y) : x = Kg , y = Kgt/j(a) , where g G G , a G £} . 

X is a rooted graph with origin o = K, the right coset corresponding to the neutral 
element 1q of the group G. The Schreier graph is fully deterministic. It is also strongly 
connected: for every pair x, y G X, there is a path from x to y. (This follows from the 
fact that ^(S) generates G as a semigroup.) When K = {1g} then we write X(G, if)). 
This is the Cayley graph of C7 with respect to if), or more loosely speaking, with respect 
to the set ^(S) of generators. 

Note that X can have the loop e = (x, a,x) £ E with x = Kg. This holds if and only if 
if){a) G g~ x Kg. It can also have the multiple edges e\ = (x, ai, y) and e 2 = (x, a 2 , y) with 
x = Kg and ai 7^ a 2 . This occurs if and only if if)(a2)if){a\)~ l G g~ x Kg. In particular, 
there might be multiple loops. The following is obvious. 

(2.4) Lemma. Let K be a subgroup of G and if) : S — *> G be a semigroup presentation of 
G. Then 

L(G, K,ij) = L 0t0 (X) 

is the language of all labels of closed paths starting and ending at o = K in the Schreier 
graph X(G,K,if>). 

A context-free grammar C = (V ,S,P , S) and the language L(C) are called linear, if 
every production rule in P is of the form T h v\Uv2 or T h v , where v , Vi,v% G S* and 
T,U G V. If furthermore in this situation one always has i> 2 = e (the empty word), then 
grammar and language are called n'<?/ii linear or regular. 

A finite automaton A consits of a finite directed graph X = (X, E, £) with label set 
S and labelling map £, together with a root vertex o and a nonempty set F G X. The 
vertices of X are called the states of A, the root o is the initial state, and the elements of 
F are the final states. The automaton is called (fully) deterministic provided the labelled 
graph X is (fully) deterministic. The language accepted by A is 

L(A) = |J L ^{X) . 

If ^4 is deterministic, then for each w G -^(^4) there is a unique path tt G [j x( z F ^o,x{X) 
such that £(7r) = w . A state y E X is called useful if there is some word wel such that 
the vertex y lies on a path in (J^p^ 7c 0tX (X) with label «;. It is clear that we can remove all 
useless states and their ingoing and outgoing edges to obtain an automaton which accepts 
the same language and is reduced: it has only useful states. It is well known [3, Chapter 
2] that a language L C £* is regular if and only if L is accepted by some deterministic 
finite automaton. 

The following generalizes Anisimov's [1] characteriziation of groups with regular word 
problem, and also simplifies its proof, as well as the simpler one of jTOj Lemma 1]. 

(2.5) Proposition. Let G be a finitely generated group, K a subgroup and -0 : S — > G a 

semigroup presentation of G. Then (G,K) has regular word problem with respect to if) if 
and only if K has finite index in G. 
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Proof. Suppose first that the index of K in G is finite. Consider the finite automaton 
A = (X, o, {o}) where X is the Schreier graph X(G, K, ijj), and the initial and unique 
final state is o = K (as a vertex of X). Then L(G, K, tp) = L(A): indeed, w G S* belongs 
to L(G,K,ifj), i.e. i/j(w) G K, if and only if K = Kif)(w). This shows that L(G, K,ifj) is 
regular. 

Conversely, suppose that L = L(G, K, if)) is regular and accepted by the reduced, 
deterministic finite automaton A = (X, o, F). For y G X there is some word w G L such 
that the vertex y lies on the unique path from o to F with label w. We choose one such 
w and let w y be the label of the final piece of the path, starting at y and ending at F. 
We set g y = ip^Wy)^ 1 G G. 

Let g G G. There are w,w G £* with ^(u>) = g and ^(uJ) = gr -1 . Thus, ww G L = 
K,ip), and there is a (unique) path tt with label t/ruT from o to some final state. 
Now consider the initial piece ir w of tt, that is, the path starting at o whose label is our 
w that we started with. [Thus, we have proved that such a path tt w must exist in X !] 
Let y be the final state (vertex) of tt w . Then clearly ww y G L(A), which means that 
gg~ x = ipiwujy) G K. Since ^(S*) = G, it follows that 

G=[jKg y , 

y ex 

and K has finitely many cosets in G. 

□ 

(2.6) Corollary. Let G be finitely generated and K a subgroup. Then the property of the 
pair (G, K) to have a regular word problem is independent of the semigroup presentation 
ofG. 

We shall see that the same also holds in the context-free case. Another corollary that 
we see from the proof of Proposition 12.51 is the following. 

(2.7) Corollary. Let G be finitely generated and K a subgroup with finite index. Then 
for any semigroup presentation tfi : S — > G, any reduced deterministic automaton A = 
(X,o,F) that accepts L(G,K,i(j) has a surjective homomorphism (as a labelled oriented 
graph with root o) onto the Schreier graph X(G, K,ip). Also, the labelled graph X is fully 
deterministic. 

Proof. Let A = (X, o, F) be deterministic and reduced, as in part 2 of the proof of 
Proposition 12.51 

Let y G X, and recall the construction of the label w y of a path from y to F, and 
g y = ^(vjy)^ 1 G G. If v is another path from y to F, and h = ^(i>) -1 , then we can take 
w G L o y (which we know to be non-empty) and find that ww y , wv G L(G, K, ip), so that 
ip(w) G Kg y H Kh. Thus Kg y = Kifj(w) = Kh, and the map k : X — > K\G, y i— > Kg y is 
well defined. It has the property that when w G L ^ y , then Kip(w) = Kg y . The map k is 
clearly surjective, and k(o) = K by construction. 

Now let y G X and a G S. Take w G L Q/y and consider the word wa. Again by part 2 
of the proof of Proposition 12.51 there is a unique path n wa in X starting at o with label 
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wa. If y is its final vertex, then there is the edge e = (y, a, z) in X. In this situation, 
k(z) = Kif)(wa) = Kg y ip(a) = K,(y)ip(a). This means that in the Schreier graph, there 
is the edge with label a from n(y) to k(z). Therefore k is a homomorphism of labelled 
graphs. □ 

The following simple example shows that, in general, the map k constructed in the 
proof of the previous corollary is not injective. 

Let G = Z 2 = {l,t} be the group of order two and K = {1} the trivial subgroup. 
Let £ = {a} and consider the presentation -0: £ — > G such that ip(a) = t. Then 
L(G,K,if;) = {a 2n : n > 0}. 




Figure 1. 



In Figure 1 above we have represented, in order, the Schreier graph X(G,K,ip) (which 
is nothing but the Cayley graph of G w.r. to i/j), and two automata A\ and Ai- As 
usual o denotes the origin, while the sets of final states are F\ = {o} and F 2 = {o, /}, 
respectively. We have L(Ai) = L(A 2 ) = L(G,K,ip). 



3. Pushdown automata 

Besides grammars, we shall need another instrument for generating context-free lan- 
guages. A pushdown automaton is a 7-tuple A = (Q, S, Z, 8, qo, Q/, zq), where Q is a 
finite set of states, S the input alphabet as above, Z a finite set of stack symbols, q e Q 
the initial state, Qj C Q the set of final states, and Zq e Z is the start symbol. Finally, 
the function S : Q x (S U {e}) x (Z U {e}) — > "Pfi n (Q x Z*) is the transition function. Here, 
7 ? fin(Q x Z*) stands for the collection of all finite subsets of Q x Z*. 

The autmaton works in the following way. At any time, it is in some state p G Q, and 
the stack contains a word (gZ*. The automaton reads a word w G S* from the "input 
tape" letter by letter from left to right. If the current letter of w is a, the state is p and 
the top (=rightmost) symbol of the stack word £ is z, it performs one of the following 
transitions. 

(i) A selects some (q, (') G 5(p, a, z), changes into state q, moves to the next position on 
the input tape (it may be empty if a was the last letter of w), and replaces the rightmost 
symbol z of £ by or 
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(ii) A selects some (q,C) e d(.P> e i z ), changes into state q, remains at the current 
position on the input tape (so that a has to be treated later), and replaces the rightmost 
symbol z of £ by 

If both 8(p, a, z) and 8(p, e, z) are empty then A halts. 

The automaton is also allowed to continue to work when the stack is empty, i.e., when 
C = e. Then the automaton acts in the same way, by putting (' in the stack when it has 

selected (g, CO d(p, a > e ) in case (i)> resp. (g, CO ^(p> e > e ) i n case (ii)- 

We say that A accepts a word w G S* if starting at the state go with only zq in the 
stack and with w on the input tape, after finitely many transitions the automaton can 
reach a final state with empty stack and empty input tape. The language accepted by A 
is denoted L(A). 

The pushdown automaton is called deterministic if for any p G Q, a G £ and z G ZU{e}, 
it has at most one option what to do next, that is, 

\8{p,a,z)\ + \8{p,e,z)\ < 1. 

(Here, | • | denotes cardinality.) 

It is well known [5] that a language is context-free if and only if it is accepted by some 
pushdown automaton. A context-free language is called deterministic if it is accepted by 
a deterministic pushdown automaton. We also remark here that a deterministic context- 
free language L is un- ambiguous, which means that it is generated by some context-free 
grammar in which every word of L has precisely one rightmost derivation. 

The following lemma is modelled after the indications of pm Lemma 2]. For the sake 
of completeness, we include the full proof. 

(3.1) Lemma. Suppose that G,K,~E and ip : S — ► G are as above. Let H be a finitely 
generated subgroup of G, and let S' be another alphabet and i/j' : £' — > H be such that 
F' = ip'(£') generates H as a semigroup. 

Then, if L(G, K, ip) is context-free, also L(H, K(~)H, ip') is context-free, and if in addi- 
tion L(G, K, if;) is deterministic, then so is L(H, K (1 H, ip'). 

Proof. We start with a pushdown automaton A = (Q, S, Z, 8, q , Q/, Zq) that accepts 

For each b G S', there is u(b) G S* such that ift'(b) = ij)(u(b)), and we may choose u(b) 
to have length > 1. Thus, 

w ' = b x ■ ■ ■ b n G L(H, K n H, u(bi) ■ ■ ■ u{b n ) G L(G, K, . 

With this in mind, we modify A in order to obtain a pushdown automaton A' that 
accepts L(H, K Pi H, tp'). Our A' has to translate any w' — bi - • • b n G (S')* into w = 
u(bi) ■ ■ ■ u(b n ) G S* and to use A in order to check whether w G L(G, K, ip). 

Let m + 1 = max{|w(6)| : b G S'}. If m = then the only modification of A needed is 
to replace S by its subset S' and to use the resulting restriction of the transition function. 

Otherwise, we set S m = S U S 2 U • • • U S m . For v G £+ = S* \ {e}, we denote by 
v + its subword obtained by deleting the first letter. We define Q' = Q U (Q x S m ) and 
A' = (Qf, S', Z, 5', g , Q/, ^o) with the transition function 8' as follows. For each p G Q 
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and z G Z, 

S'(p,e,z) = S(p,e,z) , 

S'(p, b, z) = 8(p, a, zj , if u(b) = a G £ , 

tf'(p, = { ((<?, m(6)+), C) = (9, e 5(p, a, *)} , if u (6) G a£+ , 

e,z) = {((<?, i>),C) : (9,0 e *(p,e,z)} 

U {((q,v+),£) ■ (9,0 etf(p,a,*)}, if u G aS+ , 

<y 7 ((p,o),e, z) = {((<?, a), C) : (q,() e <*(p>e, z)} U5(p,a,z), if a G £ . 

Thus, the new states of the form (p, with 1 < \v\ < m serve to remember the ter- 
minal parts v of the words u(b), b G This automaton accepts L(G , K , -0') , and it is 
deterministic, if .4 has this property. □ 

(3.2) Corollary. Being context-free is a property of the pair (G,K) that does not depend 
on the specific choice of the alphabet £ and the map ip : £ — > G for which 0(£) generates 
G as a semigroup. 

Therefore, it is justified to refer to the contxt-free pair (G, K) rather than to the triple 
(G,K,if)). Furthermore, whenever this is useful, we may restrict attention to the case 
when the graph X(G, K, tp) is symmetric: we say that tp is symmetric, if there is a proper 
involution a i— > a -1 of £ such that ip{a~ l ) = ■0(a) -1 in G. (Again, it is not necessary to 
assume that ifi is one-to-one, so that we have that a -1 ^ a even when i/j(a) 2 = 1q.) 

(3.3) Proposition. Let G be finitely generated, H be a subgroup with [G : H] < oo. If 

K is a subgroup of H then (G, K) is context-free if and only if (H, K) is context-free. 

Proof. The "only if" is contained in Lemma [37lT (Observe that H inherits finite generation 
from G, since [G : H] < oo.) 

For the converse, we assume that (H, K) is context-free and let if) : £ — > H and 
if/ : £' —>■ G be semigroup presentations of and C7, respectively. There is a pushdown 
automaton A = (Q, £, Z, 5, q , Q/, z ) that accepts L(if, -PT, '0). 

Let F be a set of representatives of the right cosets of H in G, with 1^ G F. Thus, 
|F| < oo, and 

G= |+| Hg, 

g&F 

For every g a F and 6 6 S' there is a unique g = g(g, b) G F such that gtp'{b) G if g. 
Therefore there is a word w = u(g, b) G £* such that 

#'(&) = i>{u(g,b))g(g,b) . 

An input word u; = b\- • - b n is transformed recursively into U\ - ■ -u ni along with the 
sequence g , gi, . . . , g n of elements of F that indicate the current if-coset at each step: 

go = 1 G ; u k = u(g k - h b k ) and g k = g(gk-i, b k ) ■ 

Then ip'(w) G K if and only if g n = 1q and ip(ui ■ ■ ■ u n ) G K. 
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Thus, our new automaton A' recalls at each step the current coset Hg^-i , which is 
multiplied on the right by if)(bk), where b k is the next input letter. Then the new coset 
is Hg(g k -i, 6 fc ), and A' simulates what A does next upon reading u(gk-i, b k ). Then w is 
accepted when at the end the coset is H = HIq and A is in a final state. 

The simple task to write down this automaton in detail is left to the reader. □ 



4. Context-free graphs 

In this section, we assume that (X, E, £) is symmetric. We may think of each pair 
of oppositely oriented edges (x, a, y) and (y,a _1 ,x) as one non-oriented edge, so that X 
becomes an ordinary graph with symmetric neighbourhood relation, but possibly multiple 
edges and loops. If it is in addition fully deterministic, then X is a regular graph, that 
is, the number of outgoing edges (which coincides with the number of ingoing edges) at 
each vertex is |S|. Attention: if we consider non-oriented edges, then each loop at x has 
to be counted twice, since it corresponds to two oriented edges of the form (x, a, x) and 
(x, a" 1 , x). For all our purposes it is natural to require that X is connected: for any pair 
of vertices x, y there is a path from x to y. The distance d(x,y) is the minimum length 
(number of edges) of a path from x to y, which defines the integer-valued graph metric. 
A geodesic path is one whose length is the distance between its endpoints. 

We select a finite, non-empty subset F of X and consider the balls B(F,n) = {x : 
d(x,F) < n} (where d(x,F) = mm{d(x,y) : y G F}). If we delete B(F,n) then the 
induced graph X \ B(F, n) will fall apart into a finite number of connected components, 
called cones with respect to F. Each cone is a labelled, symmetric graph C with the 
boundary dC consisting of all vertices x in C having a neighbour outside C (i.e., in 
B(F,n)). 

The following notion was introduced in [TT] for symmetric, labelled graphs and F = {o}. 

(4.1) Definition. The graph X is called context-free with respect to F if there is only 
a finite number of isomorphism types of the cones with respect to F as labelled graphs 
with boundary. 

This means that there are finitley many cones C\, . . . ,C r (generally with respect to 
different radii n) such that for each cone C, we can fix a bijection <pc from (the vertex set 
of) C to precisely one of the Cj , this bijection sends dC to dC iy and (x, a, y) is an edge 
with both endpoints in C if and only if its image (0c* (x), a, (j>c{y)) is an edge of Q. In 
this case, we say that C is a cone of type i. 

Generally, as in [TT], we are interested in the case when F = {o} (or any other singleton), 
but there is at least one point where it will be useful to admit arbitrary finite, non-empty 
F. 

Another natural notion of context-freeness of X with respect to o is to require that the 
language L 0j0 (X) is context-free. We shall see that for deterministic, symmetric graphs 
this is equivalent with context-freeness with respect to o in the sense of Definition 14. 11 
One direction of this equivalence is practically contained in [llj . but not stated explicitly 
except for the case of Cayley graphs of groups. The other direction (that context-freeness 
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of L 0i0 implies that of the graph) is shown in [TT] only for Cayley graphs of groups, which 
is substantially simpler than the general case treated below in Theorem 14.71 

(4.2) Theorem. If the symmetric, labelled graph (X, E, £) with label alphabet S is context- 
free with respect to the finite, non-empty set F C X , then L XyV is a context-free language 
for all x,y G X . Furthermore, if the graph X is deterministic, then so is the context-free 
language L x ^ y . 

Proof. Just for the purpose of this proof, we write x , y instead of x, y for the vertices for 
which L X(hyo will be shown to be context-free. We may assume without loss of generality 
that xq , i/o in F. Indeed, if this is not the case, then we can replace F by F' = B(F, n), 
which contains xq and yo when n is sufficiently large. The cones with respect to F' are 
also cones with respect to F, so that X is also context-free with respect to F' . 

Similarly to [TT1 Lemma 2.3], we construct a deterministic pushdown automaton that 
accepts L XQt y . 

We consider also the whole graph X as a cone C with boundary F, which we keep 
apart from the other representatives Ci, . . . , C r of cones. 

If C is a cone, then as a component of X\B(F, n) for some n > it must be a successor 
of another cone C~ . The latter is the unique component of X \ B(F, n — 1) that contains 
C, when n > 1, while it is Cq = X when n — 0. We also call C~ the predecessor of C. 

Different cones of type j G {1, . . . ,r} may have predecessors of different types. Con- 
versely, a cone C of type i G {0, . . . , r} may have none, one or more than one successors of 
type j, and the number of those successors depends only on i and j. In the represen- 
tative cone Ci , we choose and fix a numbering of the distinct successors of type j as C\^ , 
k — 1, . . . , dij . If C is any cone with type i then we use the isomorphism <pc '■ C — ► Ci 
to transport this numering to the successors of C that have type j, which allows us to 
identify the k-th successor of C with type j. 

One can visualize the cone structure by a finite, oriented graph T with multiple edges 
and root 0: the vertex set is the set of cone types i G {0, . . . , r}, and there are oriented 
edges, which we denote by tfj (k = 1, . . . , dy) from vertex % to vertex j (i > 0, j > 1). 

Every vertex x of X belongs to the boundary of precisely one cone C = C(x) with 
respect to F. We define the type i of x as the type of C(x). Under the mapping 4>c our 
x corresponds to precisely one element of 9Cj. We write (f>(x) for that element, without 
subscript C, so that 4> maps X onto [J { dCi . In particular, <p(x) = x for every x G F. 

Let j/Gl\F with type j. Then there is % (depending on y) such that every neighbour 
x of y with d(x, F) = d(y, F) — 1 has type i, and there is precisely one successor cone Cfj 
of Ci that contains 4>c{x){v)- in this case, we write r(y) = t\^ , the second order type of y. 
Compare with [TT]. If y' is such that C(y') = C(y) then r(y') = r(y). 

We now finally construct the required pushdown automaton A. (Comparing with [TT] . 
we use more states and stack symbols, which facilitates the description.) The set of states 
and stack symbols are 

r 

Q = |+) ^67^ and Z = Fu{tJ j :z = l,...,r, j = 0, ...,r, k = 1, . . . , d id } . 

i=0 
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(When dij = then there is no •.) Note that both sets contain F. In order to generate 
the language L xo>yo , where x ,y G F, then we use x as the initial state and y as the 
(only) final state. We describe the transition function, which - like Q and Z - does not 
depend on xo,yo ■ 

We want to read an input word, which has to correspond to the label starting at xq. 
Inside the subgraph of X induced by F, our A behaves just like that subgraph, seen as a 
finite automaton. 

Outside of F, it works as follows. At the m-th step, the automaton will be in a state 
that descibes the m-th vertex, say x, of that path, by identifying x as above with the 
element (p(x) of Cj, where j is the type of x. The current stack symbol is of the form 
and serves to recall that x lies in the k-th successor cone of type j of a cone with type i. 
If the next vertex along the path, say y, satisfies d(y, F) = d(x, F) + 1, and y has type 
f then the state is changed to (f>{y) G Cj> , and the symbol ^ •, = r(y) is added to the 
stack. If d(y,F) = d(x,F), then only the state is changed from <p{x) to <p(y). Finally, if 
d(y, F) = d(x, F) — 1 then the new state is again 4>{y) , while the top symbol in the stack 
is deleted. Formally, we get the following list of transition rules. 

IfxGF = QnZ : 

5{x,a,x) = {(y,y) : (x, a,y) EE, y g F} 

U {{(f>(y),XT(y)) : (x,a,y) G E , d(y,F) = l}. 

UxeX\F : 

5(0(x),a,r(x)) = { (<f>(y), a, r(x)r(y)) : (x, a,y) <E E , d(y, F) = d(x, F) + l} 
U {(4>(y),T(y) = T(x)) : (x,a,y) G E , c% F) = F)} 
U { (4>(y), e) : (x, a, y) G F , d(y, F) = F) - l} 

This is a finite collection of transitions, since 0(-) and r(-) can take only finitely many 
different values. 

In view of the above explanations, A accepts L xo ^ yo . Also, when the graph X is deter- 
ministic, then so is A . □ 

Before proving a converse of Theorem 14.21 we first need some preliminaries, and start 
by recalling a fact proved in [10] and [TT], see also Woess [19] and Berstel and Boas- 
son PJ. 

(4.3) Lemma. If L 0j0 is context-free then there is a constant M such that for each cone 
C with respect to o, one has diam(9C) < M . 

(The diameter is of course taken with respect to the graph metric.) We shall see below 
how to deduce this, but it is good to know it in advance. 

A context-free grammar C = (V, S, P, S) is said to have Chomsky normal form (CNF), 
if (i) every production rule is of the form T h UU or T h a, where U, IJ G V (not 
necessariliy distinct), resp. a G S, and (ii) if e G L(C), then there is the rule S h e, and 
S is not contained in the right hand side of any production rule. 

With a slight deviation from [10J, we associate with each w — a\ ■ ■ • a n G L(C), n > 2 
a labelled (closed) polygon P(w) with length n + 1. As a directed graph, it has distinct 
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vertices to, t\, . . . , t n and labelled edges (tj-i, (ii,U), i = 1, . . . , n, plus the edge (to, S, t n ). 
A (diagonal) triangulation of P{w) is a plane triangulation of P{w) obtained by inserting 
only diagonals. Here, we specify those diagonals as oriented, labelled edges (ti,T,tj), 
where ti, tj are not neighbours in P(w) and T G V. Furthermore, we will never have two 
diagonals between the same pair of vertices of P(w). (If \w\ < 2 we consider P(w) itself 
triangulated.) The proof of the following Lemma may help to make the construction of 
[TO] (used for Cayley graphs of groups) more transparent. 

(4.4) Lemma. If C = (V, S, P, S) is in CNF and w = a x • • -a n G L(C) with n>2 then 
there is a diagonal triangulation of P(w) with the property that whenever (ti,T,tj) is a 
diagonal edge, then T occurs in a derivation S ==>- w, j — i > 2 and T ==^> a i+1 • • • a, . 

Proof. We start with a fixed derivation S ==>- w, and explain how to build up the triangles 
step by step. Suppose that T G V occurs in our derivation, and that we have a "sub- 
derivation" T h UU =>• aj + i ■ ■ • ak, where U, U G V. Then there is j G {i + 1, . . . , k — 1} 
such that U ==> a i+ i ■ ■ ■ clj and U => aj + i ■ ■ ■ a^. In this case, we draw a triangle with 
three oriented, labelled edges, namely the 'old' edge (ti,T,t k ) and the two 'new' edges 
(ti,U,tj) and (tj,U,t k ). 

If we have the derivation S ==^ a\ - ■ ■ a n , then it uses successive steps of the form 
T h UU with UU =^> Oj+i • • • a k as above. We work through these steps one after the 
other, starting with S h T{Ti, where T x a x . . .a k and T\ =>■ a fc+1 ■ • ■ a n . The first 
triangle has the 'old' edge {to,S,t n ) and the 'new' edges (t ,Ti,tk) and {tk,Ti,t n ). 

At any successive step, we take one of the 'new' edges (U,T,tk), where k — i > 2 and 
proceed as explained at the beginning, so that we add two 'new' edges that make up 
a triangle together with (ti,T,tk), which is then declared 'old'. We continue until all 
derivation steps of the form T h UU in our derivation S =^ w are exhausted. At this 
point, we have obtained a tiling of triangles that constitute a diagonal triangulation of 
its outer polygon, whose edges have the form (t ,S,t n ) and U^, ti) with U G V, 

i = 1, ... ,n. The only steps of our derivation that we have not yet considered are the 
terminal ones U h eij. Thus, we conclude by replacing the label U of (ti-i,Ui,U) by 
dj. □ 



The construction is best understood by considering an example: suppose our rightmost 
derivation is 



r x (T 2 (T3a 6 )) ^(thctoH)) 

=^ Ti(T2((r 4 a5)a 6 )) =^> Ti(T 2 ((a 4 a 5 )a 6 )) 

Ti(a 3 ((a 4 a 5 )a 6 )) =>> Ti(a 3 ((a 4 a5)a6)) 
==► (T 5 T 5 )(a3((a4a5)a 6 )) (T 5 a 2 )(a 3 ((a4a5)a 6 )) 
=>- (aia 2 )(a3((a4a 5 )a6)) 



(We have inserted the parentheses to make the rules that we used in each step more 
visible.) The associated triangulation is as follows. 



14 



T. Ceccherini-Silberstein and W. Woess 




Figure 2. 

The variables of the terminal rules T 5 h a\ , T 5 h ai , T 2 h , T 4 h 04 , T 4 h a 5 and 
T 3 h a 6 are not visible in this figure (but we might add them to the boundary edges). 
Apart from this, one can read the derivation S ==>- w from the diagonalization in a similar 
way as it can be read from the so-called derivation tree (see e.g. 0, §1.6] for the latter.) 

The following goes back to [10] in the case of (Cayley graphs of) finitely generated 
groups. 

(4.5) Lemma. Let C = (V,£,P,S) be in CNF and L(C) = L x>y (X), where X is a 
deterministic, symmetric graph. If w = a\---a n G L x>y (X) and (ti,T,tj) is a diagonal 
edge in a triangulation of P(w) as in Lemma 4-4 • then the vertices x = x ai ' ai and y = 
x ai—aj of X satisfy d(x,y) < m(T), where 



(4.6) d(x, y) < m(T) = mm{\w\ : w G Lt} • 

Proof. Since X is deterministic, Lemma [2.11 implies that n x (w) exists as the unique path 
with initial vertex x and label w. In particular, x and y lie on that path. Furthermore, 
we have y = y- a j+i-«n 

Now let v G Lrp with \v\ = m(T). Then by Lemma 14.41 T arises in a derivation 
S di • ■ • a{Taj + i ■ ■ ■ a n w. But then we also have S a x ■ ■ • a^aj+i ■ • • a n , 
a word in L X)V . By Lemma I2.1[ again using that X is symmetric and deterministic, 
x v = = y. Therefore, x and y are connected by a path with label v. Its length 

is m(T). □ 

(4.7) Theorem. Let (X,E,£) be a fully deterministic, symmetric graph with label alpha- 
bet S and root o. If L O Q is a context-free language, then X is a context-free graph with 
respect to o, and in particular, L O Q is deterministic. 



Context-free pairs of groups 



15 



Proof. There is a reduced grammar C = ( V , S , P , S) in CNF that generates L 0>0 . Each 
of the languages L T , T G V, is non-empty, only L s contains e, and we define 

(4.8) m = max{m(T) : T G V} , 

where m(T) is as in (14. 6p . 

Let C be a cone with respect to o such that k = d(o, dC) > in. 

Construction of D(C). We define D(C) as the subgraph of X induced by all vertices 
y G X with 

d(o, x) = d(o, y) + d(x, y) and d(x, y) < in for some x G dC . 

In particular, y lies on some geodesic path from o to dC. 

Now let Xi,x 2 G dC, and consider some path tt G n a ; 1(a . 2 (C f ) (i.e., it lies in C). Choose 
a geodesic path n\ from o to X\ and a geodesic path 7r 2 from x 2 to o. Then we can 
concatenate the three paths to a single path it\ixTXi G n oo . Its label is the word w = 
£(7r 1 )£(7r)£(7r 2 ) G L 00 . Set n = \w\ and write 

w = (ai • • • a fc ) (a fc+ i • • • a n „ fc ) (a n _jt + i • • • a n ) 

where the 3 pieces in the parentheses are (in order) £(tti), £(t^) and £(7r 2 ). The words 
£(ni), £(n) and £(7r 2 )S' are the labels of three consecutive arcs that fill the boundary of 
the polygon P(w). (To be precise, along the last edge of the 3 rd arc, we are reading the 
label S in the reversed direction.) By [TQj, Lemma 5], its triangulation has a triangle 
which meets each of those arcs. (It may also occur that one corner of the triangle meets 
two arcs.) Thus, there are i G {0, . . . , k} and %' G {k, . . . ,n — k} such that the vertices 
ti and ty of P(w) lie on that triangle. They correspond to the vertices y% = o ai "' ai and 
y' = o ai ' a i' of X. We either have %' — i < 1, or else a diagonal (ti, U,ty) is a side of our 
triangle. By Lemma [4.51 we get d(y\,y') < m(U) < m. Thus k < i' < d(o,y') < i + m, 
that is, % > k — in > 0. In particular, ti does not lie on the third arc. In the same way, 
there is j G {n — k, . . . , n — k + m} (and not larger) such that tj is a corner of our tiangle. 
This yields that there must be a "true" diagonal (ti, T, tj) of P(w). We set v± = a i+ i ■ ■ ■ a k 
and V2 = a n -k+i • • so that x\ = y^ 1 , and let y 2 = x 2 n_fc+1 aj ■ The points y\ and y 2 
are in D(C), and by Lemma I4.4[ T =^> v\£(ti)v2 ■ 

[It is here that we can see Lemma |4.3[ since we deduce that d(xi,X2) < 3m for all 
x\, x 2 G 9C] 

By Lemma [4.41 we also have 

S ) ' fl]^ • • • CLiTcLj^i • • • CL n , 

so that v <E L T implies a\ ■ ■ ■ diVdj+i ■ ■ ■ a n G L OQ and consequently v G L yi y , 2 , that is, 
2/i = Ife- 

We now insert into D(C) the additional labelled edge (y\,V\Tv<z, 2/2), whose label is the 
word v{Tvi G S*VS*. We insert all diagonals of the same type that can be obtained in 
the same way, and write D(C) for the resulting "edge-enrichment" of D(C). 

Subsuming, we have an edge (yx,V\Tv2,y2) in D(C) if and only if the following prop- 
erties hold. 

• \vi\ < m (i = I, 2) and T G V , 
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• the path with label V\ starting at y\ and ending at x\ = y^ 1 G dC is part of a 
geodesic from o to x\ , 

• the path with label v 2 starting at x 2 = y 2 V2 G dC and ending at y 2 is part of a 
geodesic from x 2 to o , and 

• there is a path % in C from xi to x 2 such that T V\t (tt)u 2 , 

• if T ==>- v G S* then t> is the label of a path in IIj, 1)3/2 . 

Now, there are only finitely many cones C with respect to o with d(dC, 6) < m. On the 
other hand, for all cones C with d(dC, o) > m, there is a bound on the number of vertices 
of D{C), as well as on the number of possible labels on its edges. In particular, there are 
only finitely many possible isomorphism types of the labelled graphs (D(C),dC) with 
"marked" boundary DC C D(C). 

We now suppose that C and C are two cones at distance > m from o, such that 
(D(C),dC) and (D(C'),dC) are ismorphic. We claim that C and C are isomorphic, 
and this will conclude the proof that there are only finitely many isomorphism types of 
cones with respect to o. 

Let (p '■ D(C) — > D(C) be an isomorphism with <p(dC) = dC, and cf>' its inverse 
mapping. We extend to a mapping from C to C, also denoted <fi. 

Claim 1. Let x G dC and v G S + such that the path n x (v) lies in C and meets dC only 
in its initial point x. Then the path 7T x >(v) lies in C and meets dC only in its initial 
point x' = <p(x) G dC . 

Proof. If a is the initial letter of v then (always using the notation of Definition 12. ip the 
first edge of 7r x (v) is (x,a,x a ). We now consider the path ir x '(v) with label v starting at 
x' G dC. We first claim that the latter lies in C and only its initial point x' is in dC. Let 
(x' , a, (x') a ) be the first edge of the path. Then (x') a cannot lie in D(C), since otherwise 
(x, a, x a ) = (0'(V), a, (/)'(x') a ) would be an edge in D(C), a contradiction. Thus, the path 
ir x i (v) goes at least initally into C \ DC. 

So now suppose that ir x '(v) ever returns to dC, and let ir' be its initial part up to the 
first return. Then v' = i(7r x '(v)) is an initial part of v with \v'\ > 2, and 7r' is a path 
within C from x[ = x' to x 2 = (x')'"' G dC. But then, by construction, D(C') must 
contain an edge (y^, ViTv 2 , y' 2 ) such that x\ = (y[) Vl , y 2 = (x' 2 ) V2 , and T ^> v\v'v 2 . Using 
the isomorphism <fi' : D(C') — > D(C), we set yi = 4>'(y'i), i = 1,2, and x 2 = (f>'(x 2 ) G dC. 
We have of course X\ = (t)'(x[). Now we must have the edge {yi,v\Tv 2 ,y 2 ) in D{C). But 
then V\v'v 2 G L yi) y 2 , and consequently v' G L X1)X2 , that is, G 9C. But this contradicts 
the fact that n x (v) meets dC only in its initial point. We conclude that also the path 
tt x '(v) lies in C and meets dC only in its inital point, and Claim 1 is verified. 

Now let z G C \ dC. Then there are x G dC and v G S + such that z = x v and the 
path n x (v) from x to z meets dC only in its initial point x. By Claim 1, the analogous 
statement holds for the path it x i{v) in C , where x' = (j)(x). The only choice is to define 
(f)(z) — z' — (x') v , which lies in C'\dC as required. We have to show that cj) is well-defined. 
This will follow from the next claim. 
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Claim 2. Let x\,x 2 G dC, v,w G S + such that the paths 71^ (i>) and vr x . 2 (w) lie in C, 
meet dC only in their initial points and end at the same point of C \ dC. Then, setting 
x\ = 0(xj), also 7Ta./ (u) and tt^/ (w) end at the same point of C \ dC. 

Proof. Let w~ x be the "inverse" of w, as defined in Definition 12 . 11 Then 
and i>u> _1 is the label of the path from x\ to x 2 that we obtain by first following ir xi (v) 
and then the "inverse" of n X2 (w). It lies entirely in C, and only its endpoints are in 
dC. By construction, D(C) has an edge (yi, v iTv 2 , y 2 ) such that y^ 1 = xi, x v 2 = y 2 and 
T =^=>- Vivw~ 1 v 2 - We set y[ = 4>(yi), i — 1,2. Then (y[,viTv 2 ,y 2 ) is an edge of D(C'). 
Therefore i>ii> , u; _1 i>2 G Ly' 1 , y ' 2 - But this implies that fzf -1 is the label of a path from x[ to 
X2, and we know from Claim 1 that it lies in C and has only its endpoints in dC. Thus 
(x^y = (x' 2 )- w ~ 1 = {x' 2 ) w , and Claim 2 is true. 

Thus, is well defined, and the same works of course also for <f>' by exchanging the 
roles of C and C . 

Claim 3. The map (ft : C — > C" is bijective. 

Proo/. We know that : <9C -> 3C" is bijective and that <j>{C \ dC) C C" \ Let 
z G C7\ 9C, and let x G 9C, t> G S + such that n x {y) is a path from x to z that intersects 
<9C only at the initial point. Setting x' = 4>(x), z' = 4>{z), we know from the construction 
of <p and Claim 1 that tt x / (v) is a path in C from x' to z' that meets dC only in its initial 
point. Now the way how <ft' is constructed yields that (f)'(z') = z. Therefore <fi' <p is the 
identity on C. Exchanging roles, we also get the (fxf)' is the identity on C . This proves 
Claim 3. 

It is now immediate from the construction that also preserves the edges and their 
labels, so that it is indeed an isomorphism between the labelled graphs C and C that 
sends dC to dC . □ 

[TTl Cor. 2.7] says that if a symmetric labelled graph is context-free with respect to 
one root o, then it is context-free with respect to any other vertex chosen as the root x. 
In view of Theorems 14.21 and 14.71 this is also obtained from the following, when the graph 
is fully deterministic. 

(4.9) Corollary. Let (X, E, £) be a fully deterministic, strongly connected graph with label 
alphabet S. If L O Q is context-free then L x>y is deterministic context-free for all x, y G X. 

Theorems 14.21 and 14.71 together with Lemma 13.11 also imply the following. 

(4.10) Corollary. Let G be a finitely generated group and K a subgroup. 

(a) The pair (G, K) is context-free if and only if for any symmetric ip : S — >• G, the 
Schreier graph X(G, K,ip) is a context-free graph. In this case, the language L(G, K,if>) 
is deterministic for every (not necessarily symmetric) semigroup presentation if) : S — > G. 

(b) If (G,K) is context-free, then also (G^g^Kg) is context-free for every g EG. 

Proof, (a) is clear. Regarding (b), for the Schreier graph X(G, K, ip), we have L(G, K, ip) = 
L 0)0 and L(G, g~ l Kg,ifj) = L X)X with x = Kg, g G G. Thus, the statement follows from 
Corollary EU □ 
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(4.11) Lemma. Let G be a finitely generated group and K, H be subgroups with K < H 
and [H : K] < oo. 

// (G, K) is context-free then also (G, H) is context-free. 

Proof. In the context-free graph X(G, K, i/j), consider the finite set of vertices F = {Kh : 
h G H}, containing the root vertex o = Ok = K. Then L{G,H,ip) = [J x€F L ox is a 
finite (disjoint) union of context-free languages. Therefore it is context-free by standard 
facts. □ 

(4.12) Remark. In terms of Schreier graphs, we have the mapping Kg > Hg which is 
a homomorphism of labelled graphs from X = X(G, K,ip) onto Y = X(G,K,ifj) which 
is finite-to-one. The lemma says that in this situation, if X is a context-free graph then 
so is Y. We do not see an easy direct proof of this fact in terms of graphs, the main 
problem being how the homomorphism X — > Y interacts with the isomorphisms between 
the cones of X with respect to the set F. On the other hand, reforomulating this in terms 
of the associated "path languages" with the help of theorems 14.21 and 14.71 it has become 
straightforward. 

The converse of Lemma 14.111 is not true, that is, when (G, H) is context-free and 
[H : K] < oo then (G, K) is not necessarily context- free. See Example 15.91 in the last 
section. However, we have the following. 

(4.13) Lemma. If K is a finite subgroup of G then (G,K) is context-free if and only if 
G is a context-free (i.e. virtually free) group. 

Proof. Fix S and -0. Let X = X(G,if)) be the associated Cayley graph of G, and Y = 
X(G, K, ifj). We let o be the root of Y, that is, o = K\& as an element of Y (a coset). The 
group K acts on X by automorphisms of that labelled graph. It leaves the set F = K 
(now as a set of vertices of X) invariant. The factor graph of X by this action is Y. Write 
7r for the factor mapping. It is |iT|-to-one. Each cone of X with respect to F is mapped 
onto a cone of Y with respect to o, and this mapping sends boundaries of cones of X 
to boundaries of cones of Y. By assumption, Y is a context-free graph. By Lemma 14.31 
there is an upper bound on the number of elements in the latter boundaries. Therefore 
there also is an upper bound on the number of elements of any of the boundaries of the 
cones of X with respect to F. 

Without going here into the details of the definition of the space of ends of X, we refer 
to the terminology of Thomassen and Woess [TTJ and note that the above implies that 
all ends of X are thin. But then, as proved in [17J, G must be a virtually free group. □ 



One should not tend to believe that in the situation of the last lemma, the Cayley 
graphs of G are quasi-isometric with the Schreier graphs of (G, K). As a simple counter- 
example, take for G the infinite dihedral group (a, b \ a 2 = b 2 ) and for K the 2-element 
subgroup generated by a. 
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5. Covers and Schreier graphs 

We assume again that (A, E, £) is symmetric and fully deterministic. Recall the invo- 
lution a i — > a -1 7^ a of S. A word in S* is called reduced if it contains no subword of the 
form aa -1 , where a G S. We write Ts for the set of all reduced words in X*. We can 
equip Ts with the structure of a labelled graph, whose edges are of the form 

(5.1) (v,a,w) and (w,a~ l ,v), where v, w G T s , a G £ , va = w . 

Thus, the terminal letter of v must be different from a~ l . Then Ts is fully deterministic, 
and it is a tree, that is, it has no closed path whose label is a (non-empty) reduced word. 
As the root of Ts, we choose the empty word e. Then Ts is the universal cover of A. 
Namely, if we choose (and fix) any vertex o G A as the root, then the mapping 

(5.2) $ : T s A , $(u;) = o w , 

is a covering map: it is a surjective homomorphism between labelled graphs which is a 
local isomorphism, that is, it is one-to-one between the sets of outgoing (resp. ingoing) 
edges of any element w G Ts and its image $(to). (Note that this allows the image of 
an edge to be a loop.) "Universal" means that it covers every other cover of A, but this 
is not very important for us. The property of w G to be reduced is equivalent with 
the fact that the path n (w) in A is non-backtracking, that is, it does not contain two 
consecutive edges which are the reversal of each other. 

We now realize that Ts is the standard Cayley graph of the free group Fs, where £ is 
the set of free generators together with their inverses. The group product is the following: 
if v, w G T s = F s , then v ■ w is obtained from the concatenated word vw by step after 
step deleting possible subwords of the form aa -1 that can arise from that concatenation. 
The group identity is e, and the inverse of w is w' 1 as at the end of Definition 12.11 With 
$ as in (O, let 

(5.3) K = K(A) = $ _1 (o) = {w G T s : tt (w) is a closed path from o to o in A } . 

Then, under the indentification = F^ , we clearly have that K is a subgroup of Fs • 
The following is known, see e.g. Lyndon and Schupp [9, Ch. Ill] or (our personal 
source) Imrich [7j. 

(5.4) Proposition. The graph X is the Schreier graph of the pair of groups (Fs,IK(A)) 
with respect to the semigroup presentation i/j given by ip(a) = a , a G S. 

In "0(a) = a, we interpret a simultaneously as a letter from the alphabet and as a 
generator of the free group. 

Thus, in reality the study of context-free pairs of groups is the same as the study of 
fully deterministic, symmetric context-free graphs under a different viewpoint. 

The same is not true without assuming symmetry. Indeed, given a semigroup presen- 
tation ip of G, for every a G £ there must be w a G S* such i/j(w a ) = ip{ a )~ l , the inverse 
in G. But then in the Schreier Graph X(G, K, ip), for any subgroup K of G, we have the 
following: if (x, a,y) G E then y Wa = x, that is, there is the oriented path from y to x with 
label w a . In a general fully deterministic graph this property does not necessarily hold, 
even if it has the additional property that for each a G X, there is presicely one incoming 
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edge with label a at every vertex. As an example, consider X = {x, y, z}, S = {a, b} and 
labelled edges (x, a, y), (x, b, y), (y, a, z), (y, b, x), (z, a, x), (z, b, z). 

We return to the situation of Proposition 15.41 As a subgroup of the free group, the 
group K(X) is itself free. There is a method for finding a set of free generators. First 
recall the notion of a spanning tree of X. This is a tree T, which as subgraph of X is 
obtained by deleting edges (but no vertices) of X. Every connected (non-oriented) graph 
has a spanning tree, for locally finite graphs it can be constructed inductively. Now let 
T be a spanning tree of X, and consider all edges of X that are not edges of T. They 
must come in pairs (e,e _1 ). For each pair, we choose one of the two partner edges, and 
we write E for the chosen (oriented) edges. For each e e E , we choose non-backtracking 
paths in T from o to e~ and from e + to o. Together with e (in the middle), they give rise 
to a non-backtracking path in X that starts and ends at o. Let w(e) be the label on that 
path. Then the following holds [9], [7j. 

(5.5) Proposition. As elements ofFs, the w(e), e G Eq, are free generators ofK.(X). 

(5.6) Corollary. Let G be a virtually free group and K a finitely generated subgroup. 
Then (G, K) is context-free. 

Proof. Let F = Fs be a free subgroup of G with finite index. Then K = K n F is a 
free subgroup of K with [K : K] < oo. Since K is finitely generated, also IK is finitely 
generated. In the Schreier graph X of (F, K) with respect to the standard labelling by S, 
choose a spanning tree and remaining set E of edges, as described above. Since all sets 
of free generators of K must have the same cardinality, E is finite. Thus, X is obtained 
by adding finitely many edges to a tree. If o is the root vertex of X and n is the largest 
distance between o and an endpoint of some edge in Eq, then every cone C of X whith 
d(dC,o) > n is a rooted, labelled tree that is isomorphic with one of the cones of T s . 
Thus, the Schreier graph, resp. (F, K) are context-free. It now follows from Proposition 
13.31 and Lemma [4.111 that also (G,K) is context-free. □ 



We remark here that one can always reduce the study of context-free pairs to free groups 
and their subgroups. Given (G,K), let F be a finitely generated free group that maps 
by a homomorphism onto G. Let K be the preimage of K under that homomorphism. 
Then clearly (G, K) is context-free if and only (F, K) has this property. (This reduction, 
however, is not very instructive.) 

Of course, there are context-free pairs with G free beyond the situation of Corollary 

ES 

(5.7) Example. Consider the free group F = (a,b \ ) and the subgroup K with the 
infinite set of free generators {a k b l ab~ l a~ k : k,l e Z, I ^ 0}. The associated Schreier 
graph with respect to {a ±x , b ±x } is the comb lattice. 
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(2,-2) 



Figure 3. 



Its vertex set is the set of integer points in the plane. The edges labelled by a are along 
the x-axis, from (k, 0) to (k + 1,0), and there is a loop with label a at each point (k, I) 
with 1^0. The edges labelled by b are all the upward edges of the grid, that is, all edges 
from (k, I) to (k, 1 + 1), where (k, I) e Z 2 . To these, we have to add the oppositely oriented 
edges whose labels are the respective inverses (in Figure 3, the oppositely oriented edges 
together with the corresponding labels are omitted for simplicity). The comb lattice is 
clearly a context-free graph (tree). 

We proceed giving some simple examples. It is very easy to see that context-freeness is 
not "transitive" in the following sense: if (G, H) and (H, K) are context-free (with C7, H 
finitely generated and K < H < G) then in general (C7, K) will not be context-free. 

(5.8) Example. Let G = Z 2 , H = Z x {0} = Z and K = {(0, 0)}. Then H (i.e., (H, K)) 
is context-free. Of course, this also holds for (G,H), whose Schreier graphs are just the 
Cayley graphs of Z. But Z 2 (i.e., (G,K)) is not context-free. 



This also shows that the reverse of Lemma 13.11 does not hold in general (while we know 
that it does hold when [G : H] < oo). Finally, we construct examples of three groups 
K < H < G, where (G, H) is context-free, [H : K] < oo, and (C7, K) is not context-free. 

(5.9) Example. We construct a family of fully deterministic, symmetric labelled graphs 
Xw , W C Z (non-empty), and one such graph Y, so that Y is the factor graph with 
respect to the action of a 2-element group of automorphisms of each of the labelled 
graphs Xy/. While Y will be a context-free graph, many of the graphs X\y in our family 
are not context-free. We then translate this back into the setting of pairs of groups. 
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The vertex set of Xw is Z x {0, 1}. The set of labels is £ = {a, b,a 1 ,b 1 }. The edges 
are as follows: 

((k,0),a,(k + l,0)) and ((k, 1), a, (k + 1, 1)) for all k G Z , 

(O,0)A + 1,0)) and ((Jfe, 1), 6, (jfe + 1, 1)) for all k G Z \ W , and 

((fc,0),6,(fc + l,l)) and ((&, 1),6, (k + 1,0)) for all jfc G VP . 

The reversed edges carry the respective inverse labels (in Figure 4, these reversed edges 
together with the corresponding labels are omitted for simplicity). Since W ^ 0, there is 
at least one of the "crosses" (pair of the third type of edges). Therefore X w is connected. 
In general, it does not have finitely many cone types, i.e., it is not context-free. For 
example, it is not context-free when W = {k(\k\ + 1) : k G Z} 



(-3,0) (-2,0) (-1,0) (0,0) (1,0) (2,0) (3,0) 




(-3,1) (-2,1) 6 (-1,1) 6 (0,1) (1,1) 6 (2,1) (3,1) 



Figure 4- 

For arbitrary W, the two-element group that exchanges each (k, 0) with (k, 1) acts on Xw 
by label preserving graph automorphisms. The factor graph Y (see Figure 5) has vertex 
set Z and edges 

(k,a, fc+1) and (k,b,k+l) for all k G Z , 

plus the associated reversed edges (in Figure 5, these edges together with the correspond- 
ing labels are omitted for simplicity). It is clearly a context-free graph. 

a a a a a a 

(-3,1) b (-2,1) b (_i,i) b (o,i) b ( i,i) b (2,1) 6 (3,1) 

Figure 5. 

Now let F = Fs be the free group (universal cover of Xyy and Y), and for given VF, 
let Kw be the fundamental group of Xw at the vertex (0,0). Furthermore, let IK be 
the fundamental group of Y at the vertex 0. Then it is straightforward that K w has 
index 2 in K. The mapping ip is the embedding of S into F s , as above. We then have 
Y = X(¥, K, ip) and X w = X(F, K w , ip), providing the required example. 
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